# Multi-scenario Adaptation
Devstral Small 2505 GGUF
Apache-2.0
Quantized version of Devstral-Small-2505, offering multiple precision options to adapt to different hardware requirements
Large Language Model Supports Multiple Languages
D
Antigma
170
1
Ultravox V0 5 Llama 3 2 1b GGUF
MIT
Ultravox v0.5 is an audio-to-text model optimized from the Llama-3 2.1B architecture, focusing on efficient speech transcription tasks.
Speech Recognition
U
ggml-org
421
1
AM Thinking V1 GGUF
Apache-2.0
AM-Thinking-v1 is a text generation model based on the GGUF format, suitable for various natural language processing tasks.
Large Language Model
Transformers

A
Mungert
1,234
1
Andrewzh Absolute Zero Reasoner Coder 7b GGUF
Llamacpp quantized version based on andrewzh's Absolute_Zero_Reasoner-Coder-7b model, supporting multiple quantization levels, suitable for reasoning and code generation tasks.
Large Language Model
A
bartowski
1,325
5
Nousresearch.deephermes ToolCalling Specialist Atropos GGUF
DeepHermes-ToolCalling-Specialist-Atropos is a text generation model focused on tool calling, designed to achieve efficient task execution through natural language processing technology.
Large Language Model
N
DevQuasar
419
1
Allura Org Remnant Glm4 32b GGUF
Apache-2.0
Remnant-GLM4-32B is a 32B-parameter large language model based on the GLM4 architecture, supporting role-playing and conversational interactions, particularly suitable for salamander-related applications.
Large Language Model
A
bartowski
2,198
2
Multi2convai Quality De Bert
MIT
This is a Bert model optimized for German, focusing on text classification tasks in the quality domain.
Text Classification
Transformers German

M
inovex
116
0
Huihui Ai.glm 4 9B 0414 Abliterated GGUF
GLM-4-9B-0414-abliterated is a large language model with 9B parameters based on the GLM architecture, suitable for text generation tasks.
Large Language Model
H
DevQuasar
3,172
3
TRELLIS Text Xlarge
MIT
TRELLIS Text XL is a large-scale text-conditioned 3D generation model that can generate corresponding 3D content based on the input text.
Text-to-Image English
T
microsoft
7,177
12
TRELLIS Text Large
MIT
TRELLIS Text Large is a large-scale text-to-3D generation model that enables scalable and diverse 3D content generation based on a structured 3D latent space.
Text-to-Image English
T
microsoft
5,049
3
Huihui Ai DeepSeek R1 Distill Llama 70B Abliterated GGUF
GGUF quantized version of DeepSeek-R1-Distill-Llama-70B-abliterated, suitable for local inference, offering multiple quantization options to meet different hardware requirements.
Large Language Model
H
bartowski
7,848
25
Vitpose Plus Base
Apache-2.0
ViTPose is a vision Transformer-based human pose estimation model that achieves an outstanding performance of 81.1 AP on the MS COCO keypoint detection benchmark with a simple design.
Pose Estimation
Transformers English

V
usyd-community
22.26k
10
Vitpose Base
Apache-2.0
A vision Transformer-based human pose estimation model achieving an outstanding performance of 81.1 AP on the MS COCO keypoint test set
Pose Estimation
Transformers English

V
usyd-community
761
9
Summllama3.1 8B GGUF
An 8B-parameter summary generation model optimized based on Llama3 architecture, offering multiple quantization versions
Large Language Model
S
tensorblock
52
0
Sam2 Hiera Tiny.fb R896 2pt1
Apache-2.0
SAM2 model based on the HieraDet image encoder, focusing on image feature extraction tasks.
Object Detection
Transformers

S
timm
37
0
Sam2 Hiera Base Plus.fb R896 2pt1
Apache-2.0
SAM2 model weights based on HieraDet image encoder, focused on image feature extraction tasks
Image Segmentation
Transformers

S
timm
148
0
Moonshine Base ONNX
MIT
ONNX-format automatic speech recognition model based on the Moonshine base model, supporting efficient inference
Speech Recognition
Transformers

M
onnx-community
1,171
29
Wavlm Large Finetuned SER
A speech emotion recognition model based on WavLM-Large, supporting English speech emotion classification.
Audio Classification English
W
JBJoyce
139
0
Jenna Ortega Flux
Other
A LoRA model customized based on the FLUX.1-dev foundation model, specializing in generating realistic-style portraits of Jenna Ortega.
Text-to-Image
J
Keltezaa
1,452
4
Pathumma Whisper Th Large V3
Apache-2.0
Pathumma Whisper Large V3 is a Thai automatic speech recognition model based on the OpenAI Whisper architecture, supporting Thai and English speech transcription tasks.
Speech Recognition
Transformers Supports Multiple Languages

P
nectec
352
4
Allegro
Apache-2.0
Allegro is an open-source high-quality text-to-video generation model capable of producing 6-second detailed videos at 720x1280 resolution and 15 FPS.
Text-to-Video English
A
rhymes-ai
250
257
Belle Whisper Large V3 Turbo Zh
Apache-2.0
A Chinese speech recognition model fine-tuned based on whisper-large-v3-turbo, showing significant performance improvements in multiple Chinese speech recognition benchmarks
Speech Recognition
Transformers

B
BELLE-2
2,891
55
Pgtformer Base
PGTFormer is an image-to-image transformation model based on PyTorch, integrated and pushed to Hugging Face Hub via PytorchModelHubMixin.
Image Generation
Safetensors
P
kepeng
151
4
Reverb Diarization V2
Other
Reverb Speaker Diarization V2 is a speaker diarization model based on pyannote-audio, outperforming the baseline pyannote3.0 model on multiple test sets.
Audio Processing
R
Revai
4,073
45
Base ZhEn
This model is used to convert image content into textual descriptions and is suitable for non-commercial purposes.
Text Recognition
B
MixTex
50
0
Add Detail Xl
add-detail-xl is a detail adjustment model for SDXL. It can increase or decrease image details by adjusting the weight, bringing more flexibility to image generation.
Image Generation
A
LyliaEngine
327
4
Moralbert Predict Subversion In Lyrics
MIT
This is a PyTorch-based text classification model suitable for various text classification tasks.
Text Classification
Transformers

M
vjosap
17
1
Image Captioning Vit Gpt2 Flick8k
Apache-2.0
This model can convert input images into descriptive text, suitable for image understanding tasks in various scenarios.
Image-to-Text
Transformers

I
pltnhan311
18
0
Detr Face Detection
Openrail
A face detection model based on the CreativeML-OpenRAIL-M license, supporting the English language, primarily used for object detection tasks.
Object Detection
Transformers English

D
diffusionai
108
1
Final Model
Apache-2.0
This model is an image-to-text model based on the Apache-2.0 license, capable of converting image content into textual descriptions.
Text Recognition
Transformers

F
goatrider
17
0
Gemma 7b Finetuned
MIT
A prompt optimization model fine-tuned using the QLORA method, specifically designed to enhance the clarity and effectiveness of text prompts.
Large Language Model
Transformers

G
zamal
52
5
Parrots Chinese Hubert Base
Apache-2.0
The Chinese HuBERT base model is a pre-trained model for text-to-speech tasks, supporting Chinese speech processing.
Speech Synthesis
Transformers Chinese

P
shibing624
35
1
Parrots Chinese Roberta Wwm Ext Large
Apache-2.0
Chinese pre-trained model based on RoBERTa architecture, supporting text-to-speech tasks
Large Language Model
Transformers Chinese

P
shibing624
76
2
Imagecaptioningtransformers
Apache-2.0
This model can convert input images into descriptive text, suitable for various image content understanding tasks across multiple scenarios.
Image Generation
Transformers

I
adityarajkishan
13
0
Openbuddy Deepseek 10b V17.1 4k GGUF
Other
OpenBuddy is an open-source multilingual chatbot that supports communication in multiple languages.
Large Language Model Supports Multiple Languages
O
LoneStriker
82
3
Belle Distilwhisper Large V2 Zh
Apache-2.0
A Chinese speech recognition model fine-tuned based on distilwhisper-large-v2, with a speed 5.8 times faster than whisper-large-v2 and 51% fewer parameters
Speech Recognition
Transformers

B
BELLE-2
230
37
Whisper Large V3 French Distil Dec8
MIT
This is a distilled version of the Whisper-Large-V3 French model, optimized for inference speed and memory usage by reducing the number of decoder layers while maintaining good performance.
Speech Recognition
Transformers French

W
bofenghuang
32
4
Whisper Large V3 French
MIT
A French automatic speech recognition model fine-tuned based on OpenAI Whisper-large-v3, supporting case sensitivity, punctuation, and number prediction
Speech Recognition
Transformers French

W
bofenghuang
771
28
Orionstar Yi 34B Chat Llama GGUF
Other
OrionStar Yi 34B Chat Llama is a large language model based on the Yi 34B architecture, focusing on Chinese dialogue tasks.
Large Language Model Other
O
TheBloke
557
16
Noromaid 20b V0.1.1
Noromaid-20b-v0.1.1 is a large language model suitable for role-playing, emotional role-playing, and general scenarios, jointly developed by IkariDev and Undi.
Large Language Model
Transformers

N
NeverSleep
145
51
- 1
- 2
- 3
Featured Recommended AI Models